Simplifying the Analysis of Complex Survey Data Using the SAS
نویسندگان
چکیده
Large sample-based surveys often have complex sample designs, with design features including stratification, clustering, multi-stage sampling, and unequal probability of selection of observations. The calculation of the associated sampling weights often involves nonresponse adjustments and raking to external control totals. The analysis usually includes descriptive statistics such as frequencies, means, totals and their standard errors. Using standard statistical software modules such as PROC SUMMARY, PROC FREQ and PROC MEANS to analyze such data results in underestimation of variance, as these routines assume that the data is from a simple random sample and do not take into account the complex nature of the sample. Now, however, the survey analysis procedures such as PROC SURVEYMEANS and PROC SURVEYFREQ that have been added to SAS/STAT ® software can compute variances that accurately reflect complex sample design and estimation procedures. This paper compares the complexity of the variance estimation code used in earlier projects with the simplicity of the code that is possible using the survey analysis procedures.
منابع مشابه
An Introduction to Multiple Imputation of Complex Sample Data using SAS ®
This paper presents practical guidance on the proper use of multiple imputation tools in SAS® 9.2 and the subsequent analysis of multiple imputed data sets from a complex sample survey data set. Use of the MI and MIANALYZE procedures and SAS survey procedures for typical descriptive and inferential analyses is demonstrated. The analytic techniques presented can be used on any operating system a...
متن کامل133-2007: Alternatives for Analysis of Complex Sample Surveys: A Comparison of SAS®, SUDAAN®, and AM® Software
Data collected from complex sample surveys are frequently used in secondary data analysis. The analysis of such data using standard statistical software yields biased point estimates of parameters and incorrect standard errors as a result of unequal probabilities of selection of observations, stratification and clustering, and unit nonresponse. The appropriate analysis of data from complex samp...
متن کاملTransitioning to R : Replicating SAS , Stata , and SUDAAN Analysis Techniques in Health Policy Data
Statistical, data manipulation, and presentation tools make R an ideal integrated package for research in the fields of health policy and healthcare management and evaluation. However, the technical documentation accompanying most data sets used by researchers in these fields does not include syntax examples for analysts to make the transition from another statistical package to R. This paper d...
متن کاملPrinciples of Proper Inferences from Complex Survey Data
This paper discusses commonly-encountered features of complex survey datasets that should be accounted for by employing the SURVEY family of SAS® procedures. Concepts of clustering, stratification, unequal weighting, finite population corrections, and replication variances approximation methods new to SAS 9.2 will be presented. Potential pitfalls will be highlighted as will a few methods to tri...
متن کاملSAS/OR®: Introducing SAS® Simulation Studio
Discrete event simulation is used to model, study, plan, and improve systems in which random events play a dominant role. These systems are often driven by complicated mathematical and logical relationships, making it impossible to derive an analytical solution. The journal OR/MS Today cites simulation software as one of the most widely used tools in operations research, due in part to its appl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012